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Abstract 

Influence maximization is a problem of finding a small set of highly influential users, also known 
as seeds, in a social network such that the spread of influence under certain propagation models is 
maximized. In this paper, we consider time-critical influence maximization, in which one wants to 
maximize influence spread within a given deadline. Since timing is considered in the optimization, we 
also extend the Independent Cascade (IC) model and the Linear Threshold (LT) model to incorporate 
the time delay aspect of influence diffusion among individuals in social networks. We show that time- 
critical influence maximization under the time-delayed IC and LT models maintains desired properties 
such as submodularity, which allows a greedy approximation algorithm to achieve an approximation ratio 
of 1 — 1/e. To overcome the inefficiency of the greedy algorithm, we design two heuristic algorithms: the 
first one is based on a dynamic programming procedure that computes exact influence in tree structures 
and directed acyclic subgraphs, while the second one converts the problem to one in the original models 
and then applies existing fast heuristic algorithms to it. Our simulation results demonstrate that our 
algorithms achieve the same level of influence spread as the greedy algorithm while running a few orders of 
magnitude faster, and they also outperform existing fast heuristics that disregard the deadline constraint 
and delays in diffusion. 
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2012). The copyright of that conference version is held by the Association for the Advancement of Artificial Intelligence (AAAI). 

^ Part of this work was done while Wei Lu was a visiting student at Microsoft Research Asia. Support in part by The Natural 
Sciences and Engineering Research Council of Canada (NSERC). 

■f-This work was done while Ning Zhang was a visiting student at Microsoft Research Asia. 



1 



1 Introduction 



Recently, the rapidly increasing popularity of online social networking sites such as Facebook, Twitter, and 
Google+ opens up great opportunities for large-scale viral marketing campaigns. Viral marketing, first 
introduced to the data mining community by Domingos and Richardson [5 , is a cost-effective marketing 
strategy that promotes products by giving free or discounted items to a selected group of highly influential 
individuals (seeds), in the hope that through the word-of- mouth effects, a large number of product adoption 
will occur. 

Motivated by viral marketing, influence maximization emerges as a fundamental data mining problem 
concerning the propagation of ideas, opinions, and innovations through social networks. In their seminal 
paper, Kempe et al. [10 formulated influence maximization as a problem in discrete optimization: Given 
a network graph G with pairwise user influence probabilities on edges, and a positive number /c, find k 
users, such that by activating them initially, the expected spread of influence is maximized under certain 
propagation models. Two classical propagation models studied in the the literature are the Independent 
Cascade (IC) and the Linear Threshold (LT) model. 

The family of models considered in Kempe et al. and its follow-ups, including IC and LT, do not fully 
incorporate important temporal aspects that have been well observed in the dynamics of influence diffusion. 
First, the propagation of influence from one person to another may incur a certain amount of time delay ^ 
which is evident from recent studies by statistical physicists on empirical social networks. Iribarren and 
Moro PI observed that the spread of influence in social networks slows down due to heterogeneity in human 
activities. Karsai et al. [9] reported similar observations and attributed such slow-down to the bursty nature 
of human interactions and the topological correlations in networks with the small- world property. 

Second, the spread of influence may be time-critical in practice. In a certain viral marketing campaign, 
it might be the case that the company wishes to trigger a large volume of product adoption in a fairly short 
time frame, e.g., a three-day sale. As a motivating example, let us suppose that Alice has bought an Xbox 
360 console and Kinect with a good discount, but the deal would only last for three days. Alice wanted to 
recommend this deal to Bob, but whether her recommendation would be effective depends on whether Alice 
and Bob can be in touch (e.g., meeting in person, or Bob seeing the message left by Alice on Facebook) 
before the discount expires. Therefore, when we try to maximize the spread of influence for a viral marketing 
campaign facing this kind of scenarios, we need to take both the time delay aspect of influence diffusion and 
the time-critical constraint of the campaign into consideration. 

To this end, we extend the influence maximization problem to have a deadline constraint to reflect the 
time-critical effect. We also propose two new propagation models, the Independent Cascade model with 
Meeting events (IC-M) and the Linear Threshold model with Meeting events (LT-M) to capture the delay of 
propagation in time. We show that both the IC-M and LT-M models maintain desired properties, namely 
monotonicity and submodularity, which implies a greedy (1 — 1/e)- approximation algorithm in spite of the 
NP-hardness of the problem. 

For the IC-M model, we design two efficient and effective heuristic algorithms, MIA-M and MIA-C, based 
on the notion of Maximum Influence Arborescence (MIA) [2 to tackle time-critical influence maximization. 
On the other hand, similarly for the LT-M model, we adapt the notion Local Directed Acyclic Graph 
(LDAG) proposed in [4 to obtain the LDAG-M heuristic algorithm. Our experiments evaluate performance 
of various algorithms for the IC-M model, including the greedy approximation algorithm, MIA-M, and MIA- 
C heuristics. The empirical results demonstrate that our heuristic algorithms produce seed sets with equally 
good quality as those mined by approximation algorithm, while being two to three orders of magnitude 
faster. Moreover, we show that only using standard heuristics such as MIA and disregarding time delays 
and deadline constraint could result in poor influence spread compared to our heuristics that are specifically 
designed for this context. 

1.1 Related Work 

Domingos and Richardson O [14] first posed influence maximization as an algorithmic problem. They 
modeled the problem using Markov random fields and proposed heuristic solutions. Kempe et al. studied 
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influence maximization as a discrete optimization problem. They showed that the problem is NP-hard 
under both the IC and LT models, and relied on submodularity to obtain a (1 — 1/e) greedy approximation 
algorithm. 

A number of studies following [10] developed more efficient and scalable solutions, including the cost- 
effective lazy forward (CELF) optimization [12] and also work by Kimura et al. [H], Chen et al. [3 , etc. 
Specifically for the IC model, Chen et al. [2] showed that it is #P-hard to compute the exact infiuence of 
any node set in general graphs. They proposed the MIA model which uses infiuence in local tree structures 
to approximate infiuence propagated through the entire network. They then developed scalable algorithms 
to compute exact infiuence in trees and mine seed sets with equally good quality as those found by the 
approximation algorithm. For the LT model, it is also #P-hard to compute the exact infiuence in general 
graphs [4]. To circumvent that, Chen et al. [4 proposed to use local directed acyclic graphs (LDAG) to 
approximate the infiuence regions of nodes. 

More recently, Chen et al. [1 extended the IC model to capture the propagation of negative opinions in 
information diffusion. Goyal, Bonchi, and Lakshmanan [7| leveraged real propagation traces to derive more 
accurate diffusion models. In [6 , Goyal et al. studied the problem of MINTIME in inffuence maximization. 
In MINTIME, an inffuence spread threshold r] and a budget threshold k are given, and the task is to find 
a seed set of size at most k such that by activating it, at least 77 nodes are activated in expectation in the 
minimum possible time. This problem also considers timing in infiuence maximization, but the number 
of time steps in MINTIME is the optimization objective, whereas in our case, time is given as a deadline 
constraint. 

The time-delay phenomena in information diffusion has been explored in statistical physics. Iribarren 
and Moro [^ observed from a large-scale Internet viral marketing experiment in Europe that the dynamics 
of information diffusion are controlled by the heterogeneity of human activities. More recently, using time- 
stamped phone call records, Karsai et al. [9] found that the spreading speed of information on social networks 
is much slower than one may expect, due to various kinds of correlations, such as community structures in 
the graph, weight-topology correlations, and bursty event on single edges. 

2 Time-Critical Influence Maximization for Time-Delayed Inde- 
pendent Cascade Model 

2.1 Model and Problem Definition 

We first describe the standard Independent Cascade (IC) model in Kempe et al. p!0|, and then show our 
extensions that incorporate deadline and random meeting events. In the IC model, a social network is 
modeled as a directed graph G = (V^E)^ where V is the set of nodes representing users and E is the set of 
directed edges representing links (relationship) between users. Each edge {u, v) e E is associated with an 
influence probability p{u^ v) defined by function p : E ^ [0^1]. If (li, v) E^ define p{u^ v) = 0. 

The diffusion process under the IC model proceeds in discrete time steps 0, 1, 2, ... . Initially, a seed set 
C V is targeted and activated at step 0, while all other nodes are inactive. At any step t > 1, any node 
u activated at step t — 1 is given a single chance to activate any of its currently inactivate neighbors v with 
independent success probability p{u, v). Once a node is activated, it stays active. The process continues until 
no new nodes can be activated. The influence maximization problem under the IC model is to find a seed 
set S with at most k nodes such that the expected number of activated nodes after the diffusion terminates, 
called influence spread and denoted by cr(6'), is maximized. 

We now describe our extension to the IC model to incorporate time-delayed influence diffusion, which we 
denote by IC-M (for Independent Cascade with Meeting events). In the IC-M model, each edge (u^v) G E 
is also associated with a meeting probability m{u, v) defined by function m : E ^ [0^1] (if {u^ v) ^ E, 
m{u^v) = 0). As in IC, a seed set S is targeted and activated at step 0. At any step t > 1, an active node 
u meets any of its currently inactive neighbors v independently with probability m(ii, v). If a meeting event 
occurs between u and v for the first time, u is given a single chance to try activating with an independent 
success probability p{u^v). If the attempt succeeds, v becomes active at step t and will start propagating 
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influence at t-\-l. The diffusion process quiesces when all active nodes have met with all their neighbors and 
no new nodes can be activated. 

Several possibilities can be considered in mapping the meeting events in the IC-M model to real actions 
in online social networks. For instance, a user u on Facebook posting a message on her friend v's wall can 
be considered as a meeting event. In the IC-M model, the meeting probabilities are not necessarily the same 
for different pair of users. Different pairs of friends may have different frequencies of exchanging messages 
on each other's walls, which is reflected by the meeting probability. 

Note that the original IC model is a special case of IC-M with m{u, v) = 1 for all edges {u, v) G E. 
More importantly, for the original influence maximization problem, the meeting probability is not essential, 
because as long as m{u^ v) > 0, eventually u will meet with v and try to influence v once. Thus, if we only 
consider the overall influence in the entire run, there would be no need to introduce meeting probabilities. 
However, if we consider influence within a deadline constraint^ then meeting probability is an important 
factor in determining the optimal seed set. 

Formally, for a deadline r G Z+, we deflne (7^- : 2^ ^ to be the set function such that (Jr{S) with 
C V is the expected number of activated nodes by the end of time step r under the IC-M model, with 
S as the seed set. The time-critical influence maximization with a deadline constraint r is the problem of 
flnding the seed set S with at most k seeds such that the expected number of activated nodes by step r is 
maximized, i.e., flnding 6** = arg max5cy,|5|<fc ^r(5')- 

Note that the original influence maximization problem is NP-hard for the IC model [10] and that problem 
is a special case of time-critical influence maximization for the IC-M model with all m(ix, v) = 1 and deadline 
constraint r = \V\. This leads to the following hardness result. 

Theorem 1. The time-critical influence maximization problem is NP-hard for the IC-M model. 
2.2 Properties of the IC-M Model 

Although to flnd the optimal solution for time-critical influence maximization with deadline r under IC-M 
is NP-hard (Theorem [T]), we show that the influence function o-r{-) is monotone and submodular, which 
allows a hill-climbing-style greedy algorithm to achieve a (1 — 1/e) -approximation to the optimal. 

Given a ground set [/, a set function /: 2^ ^ R is monotone if f{Si) < f{S2) whenever Si C S2- 
Also, the function is submodular if f{Si U {w}) - f{Si) > f{S2 U {w}) - /(^s), V5'i C ^2, G /7 \ S2. 
Submodularity captures the law of diminishing marginal returns, a well-known principle in economics. 

Theorem 2. The influence function o-r{-) is monotone and submodular for an arbitrary instance of the 
IC-M model, given any deadline constraint r > 1. 

To prove the theorem, we can view the random cascade process under IC-M using the "possible world" 
semantics and the principle of deferred decisions. That is, we can suppose that before the cascade starts, 
a set of outcomes for all meeting events, as well as the "live-or-blocked" identity for all edges are already 
determined but not yet revealed. 

More speciflcally, for each meeting event (a {u^v) pair and a time step t G [1,t]), we flip a coin with 
bias m(ii, v) to determine if u will meet v at t. Similarly, for each edge {u^ v) G we flip once with bias 
p{u^v)^ and we declare the edge "live" with probability p{u^v)^ or "blocked" with probability 1 — p{u^v). 
All coin-flips are independent. The identity of the edge (ix, v) is revealed in the event that u is active and is 
meeting the inactive v for the flrst time. Therefore, a certain set of outcomes of all coin flips corresponds to 
one possible worlds denoted by X, which is a deterministic graph (with all blocked edges removed) obtained 
by conditioning on that particular set of outcomes. 

Now we prove Theorem [2j 

Proof of Theorem^ Fix a set Xm of outcomes of all meeting events (V(2x, v) G E^yt G [0, r]), and also a set 
Xe of live-or-blocked identities for all edges. Since the coin-flips for meeting events and those for live-edge 
selections are orthogonal, and all flips are independent, any Xe on top of an Xm leads to a possible world 
X. 
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Next, we define the notion of "reachability" in X. Consider a hve edge iu^v) in X. Traditionally, without 
meeting events, v is reachable from u via just one hop. Now with pre-determined meeting sequences, v is 
reachable from u via ty — tu hops, where tu is the step in which u itself is reached, and ty is the first step 
when u meets after t^- Hence, we say that v is reachable from a seed set S if and only if (1) there exists 
at least one path consisting entirely of live edges (called live-path) from some node in 6* to f , and (2) the 
collective number of hops along the shortest live-path from 5 to is no greater than r. 

Then, let (S) be the number of nodes reachable from S in X (by the reachability definition above). 
Let Si and 5*2 be two arbitrary sets such that Si C S2 ^ V, and let node w e V \ S2 he arbitrary. The 
monotonicity of (t^ {■) holds, since if some node u can be reached by S^i, the source of the live-path to u 
must be also in 5*2. As for submodularity, consider a certain node u which is reachable from S2^ {w} but 
not from S2- This implies (1) u is not reachable from either, and (2) the source of the live-path to u 
must be w. Hence, u is reachable from S^i U {w} but not from S^i. This gives {Si U {w}) — (Si) > 
a^iS2U{w})-a^iS2). 

Let £1 denote the event that / is the true realization (virtually) of the corresponding random process. 
Taking the expectation over all possible worlds, we have ar{S) = Fy[£x] • cr^{S)yS C y, where X is 
any combination of X^; and Xm, and Pr[fx] = Pi"[fx£;] • Pi"[^Xm]- Therefore, cr(-) is a nonnegative linear 
combination of monotone and submodular functions, which is also monotone and submodular. This was to 
be shown. □ 



3 Time-Critical Influence Maximization under Time-Delayed Lin- 
ear Threshold Model 

3.1 Model and Problem Definition 

In the LT model, a social network is modeled as a directed graph G = {V^E), where V is the set of 
nodes representing individuals and E is the set of directed edges representing links (relationships, ties, etc.) 
between individuals. Each edge (i^, v) G E is associated with an influence weight b{u, v) defined by function 
b: E ^ [0,1]. We require that ^y^b{u,v) < 1, Wv e V. If {u,v) E, define b{u,v) = 0. On the other 
hand, each node v chooses a threshold 6y uniformly at random from the interval [0, 1], which represents the 
weighted fraction of v^s in-neighbors that must be active so as to let v become active. 

Given the random choices of node thresholds, and a seed set S CV, the dynamics of the diffusion process 
under the LT model proceed in discrete time steps 0, 1, 2, ... . Initially, at step 0, nodes in S are activated. 
Then at any step t > 1, nodes who have been active in previous steps remain active, and we activate any 
node V such that its threshold Oy is surpassed by the total weights of its currently active in-neighbors. That 

Sactive ueAf^''{v) ^{^^^) ^ The process continues until no new nodes can be activated. The influence 
maximization problem under the LT model is to find a seed set S with IS*! < k such that the expected 
number of activated nodes after the diffusion process terminates, called influence spread and denoted by 
cr(S'), is maximized. 

We now describe our extension to the LT model that incorporates time-delayed diffusion processes, which 
we call LT-M (Linear Threshold model with Meeting events). In the LT-M model, each edge {u,v) ^ E is 
also associated with a meeting probability m{u,v) defined by function m: E ^ [0, !]• Note that m{u,v) = 
if (1/, v) E. Same as in LT, nodes choose a uniform random number out of [0, 1] as threshold, and a seed 
set S is targeted and activated at step 0. 

At any step t > 1, each active node u remains active and it meets any of its currently inactive out- 
neighbors V with probability m{u,v), independently. If since u's activation, u and v meet for the first time 
at t, then we say u's infiuence weight to v is effective. An inactive v becomes active if the total effective 
weight from its active in-neighbors is at least Oy. That is, 

^ b{U,V) > Oy, 
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where M^'^{v) C M^'^{v) denotes the set of effective in-neighbor of v. After v becomes active, it starts to 
meet her neighbors from the next time step. The process quiesces when ah active nodes have met with their 
out-neighbors and no new nodes can be activated. 

Note that the original influence maximization problem is NP-hard for the LT model [10] and that is a 
special case of time-critical influence maximization under LT-M with all m{u^v) = 1 and deadline r = \V\. 
This leads us to the following hardness result. 

Theorem 3. The time-critical influence maximization problem is NP-hard for the LT-M model. 



3.2 Properties of the LT-M Model 

Theorem 4. The influence function (Jt{-) is monotone and submodular for an arbitrary instance of the 
LT-M model, given any deadline constraint r > 1. 

To prove Theorem [4j let us first give an alternative form of the model definition for LT-M. The main idea 
is to show that the diffusion process guided by LT-M is equivalent to one guided by a random "live-edge" 
selection process. Recall that in LT-M, each edge (u^v) has an weight b{u^v)^ and ^ub{u^v) < 1 for all 
V. The random edge selection protocol lets v pick at most one incoming edge (and the corresponding in- 
neighbor) independently at random, selecting a particular u with probability b{u^ v), or selecting no one with 
probability 1 — '^ub{u^v). If v selects then we declare edge (u^v) "live"; otherwise declare it "blocked". 
Kempe, Kleinberg and Tardos [10 showed that the distribution over the final active node sets obtained by 
running the Linear Threshold process is equivalent to that by running the above random live-edge selection 
process. 

In our case, it is also necessary to incorporate the random meeting events in LT-M into the "live-edge" 
(LE) process and obtain the "LE-M" model. Specifically, now, at any step t, we activate any inactive v 
provided that its selected neighbor u was activated at some earlier step t^ < t, and since + 1, they meet 
for the first time here at t. 

Lemma 1. The Linear Threshold model incorporated with random meeting events is equivalent to the live- 
edge model incorporated with random meeting events. In other worlds, the distribution over the final active 
sets under the LT-M model is the same as the one under the LE-M model. 

Proof. Let At{v) be the set of nodes that are already active and have met v at least once by the end of 
step t. For the linear threshold process, if some v is not yet active by the end of step t, the chance that 
V will become active at the next step, t + 1, is the probability that the incremental weight contributed by 
At{v) \ At-i{v) manages to push the total effective weight as of t + 1 over Oy. Denote this probability by 
Pi{v,t^l). 

For the live-edge process, we analyze the probability of the same event, that is, an inactive v will be 
activated at t + 1, given that it has not been active yet by the end of t. Let Si be the event that the 
corresponding node u of the live-edge v selected becomes active at t, and u meets v right away at t — 1. Let 
£2 be the event that this selected u has become active before t, but u and v have not met until t -\- 1. The 
probability that we want equals to Pr[fi U £2]. Notice that since v selects at most one live-edge, £1 and £2 
are mutually exclusive. Let this probability be P2{v^t + 1). 

What remains to be shown is that Pi{v^t + 1) = P2{v^t + 1). Notice that the meeting events are all 
independent and orthogonal to the live-edge selection or the linear threshold process. Thus, by the principle 
of deferred decision, we can treat these meeting events as if their outcomes were determined before the 
diffusion starts, but were only going to get revealed as the process proceeds. Without loss of generality, we 
fixed a set M of outcomes, by independently flipping a coin with bias m{u^ v)^ V(ii, v) G E in all time steps. 

Conditioning on the fixed M, for any v G V and tG{l,...,r — 1}, we have 

t+i) = pi^iv, t+i) = ^"^(^(^)\^^-.(^))^("';) . 
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Then, by un-conditioning, i.e., taking the expectation over ah possible outcomes of meeting events, we have 
Pi{v,t + 1) = P2{v^t + 1). This completes the proof. 

□ 

With Lemma [l] we can apply the same argument used in the submodularity proof for the IC-M model, 
to prove Theorem [4j The flow of the proof and the techniques used in it is similar to the proof of Theorem [2] 
We first run the live-edge selection random process and flip coins for all meeting events. Conditioning on 
a fixed set of meeting events outcomes, and removing all edges but the live ones, we obtain a deterministic 
graph (possible world). Then following the same arguments used in the proof of Theorem [2] leads us to the 
submodularity of the influence function cTt^-) for the LT-M model. 

Approximation Guarantees. Thus far, we have shown that the influence function cTt^-) is monotone 
and submodular under both IC-M and LT-M models, then our time-critical influence maximization problem 
is a special case of monotone submodular function maximization subject to a cardinality (uniform matroid) 
constraint. Therefore, we can apply the celebrated result in Nemhauser et al. [13 to obtain a greedy (1 — 1/e)- 
approximation algorithm. The greedy algorithm repeatedly grows S by adding u with the largest marginal 
influence w.r.t S in each iteration until l^l = k. 

Generalizing to the Triggering Set ModeL So far we have obtained a constant factor approximation 
algorithm for the time-critical influence maximization under both IC-M and LT-M models (Theorem |2] and 
Theorem |4|. In fact, this result can be applied to a more general time-delayed propagation model called the 
Triggering Set model with Meeting events (TS-M). 

In the original Triggering Set (TS) model flO], each node v eV independently chooses a random triggering 
set T{v) C A/'*^('u), according to some distribution over all subsets of JV^^{v). During the influence diffusion 
process guided by the TS model, a node v becomes active at step t if there is some u G T{v) that is active 
at step t — 1. Viewing the process using the live-edge model, we declare the edge {u^v) live if u is selected 
into the triggering set of v; otherwise it is declared blocked. 

The Triggering Set model generalizes IC and LT for the following reasons. Recall that for IC, each 
edge (u^v) in the graph is live with an independent probability p{u^v). Thus, for any we can view the 
triggering set selection process as v adds each in-neighbor u independently. Meanwhile, for LT, recall that it 
is equivalent to the live-edge selection process, which amounts to saying that v picks at most one in- neighbor 
u into its triggering set with probability b{u^ v)^ and picks an empty set with probability 1 — b{u^ v). 

The meeting events are orthogonal and independent from the triggering set selection process, and thus 
the TS-M model still generalizes the IC-M and LT-M models. Using the same arguments in the proofs for 
Theorem [2] and Theorem [4| the influence function (7r{') is monotone and submodular for every instance of 
the TS-M model as well, and thus this general model also enjoys the approximation guarantees provided by 
the greedy algorithm. 

Inefficiency of the Greedy Approximation Algorithm. Although the greedy approximation algo- 
rithm, it is #P-hard to compute the exact influence in general graphs for the IC and LT models [2l|4]. The 
hardness applies to IC-M and LT-M, since each of them subsumes corresponding original model. A common 
practice is to estimate influence spread using Monte-Carlo (MC) simulations, in which case the approxima- 
tion ratio of Greedy drops to 1 — 1/e — e, where e is small if the number of simulations is sufficiently large. 
Due to expensive simulations, the greedy algorithm is not scalable to large data, even the implementation 
can be accelerated by the CELF optimization ^12] . 

4 Computing Influence in Arborescences in the IC-M model 

In this section, we derive an dynamic programming algorithm that computes exact influence spread in tree 
structures, which will be used in Sec. [5] to develop MIA-M. The algorithmic problem of efficient computation 
of exact influence spread in trees in the IC-M model with deadline constraint is also of independent interest. 
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Algorithm 1: Greedy (G = {V,E), k, a^) 



1 5^0; 

2 for i = 1 ^ k do 

u ^ argmax^^y\^ [(Jr{S U {v}) - (Jr{S)\ ; 

s^syj{u}- 

5 Output 5*; 



An in-arborescence is a directed tree where all edges point into the root. Given a graph G = (V^E) 
with influence probability function p and meeting probability function m, consider an in-arborescence A = 
(Va^Ea) rooted at v where Va ^ V and Ea ^ E. We assume that influence propagates to v only from 
nodes in A. We also assume that there exists at least one s G S such that s G Va] otherwise no nodes can 
be activated in A. Given a seed set S and deadline r, we show how to compute ar{S) in A in time linear to 
the size of the graph. 

Let ap{u,t) be the activation probability of u at step i.e., the probability that u is activated at step t 
after the cascade ends in A. Since the events that u gets activated at different steps are mutually exclusive, 
the probability that u ever becomes active by the end of step r is Xlt=o 1). By linearity of expectation, 
ar{S) = X^iiGV 5]]t=o ^^(^' Hence, the focus is to compute ap{u,t)^ for which we have the following 
theorem. 

Theorem 5. Given any u in arborescence A, and any t G [0,r]^ the activation probability ap{u^t) can be 
recursively computed as follows. 

For base cases when u G S or t = 0, 

( 1 {ueSAt = 0) 

ap{u,t)=< {u^ S At = 0) (1) 

[ (i/G^AtG {!,..., r}) 

For u ^ S At e {1,. . . ,r}, 

t-2 

ap{u, t)= II (l - ^ ap{w, t') p{w, u) [1 - (1 - m{w, u))'-''-^]) 

- n - E ^) [1 - (1 - ^(^' ^)Y~'']) ^ (2) 

where J^^^{u) C Va is the set of in-neighbors of u in A. 

Proof The base cases {u e S or t = 0) are trivial. When u ^ S and t G {1, . . . ,r}, for any in-neighbor 
w ^ Va of u and f < t, p{w,u)[l — (1 — m{w,u)y~^ is the probability that w meets u at least once 
from + 1 to t — 1 and that {w^ u) is live. Since the events that w gets activated at different t' are mutually 
exclusive, 1 — Xlt'^o ^P{^^ t^)p{^i ^) (l ~ (1 ~ m{w^ u)y~^ is the probability that u has not been activated 
by w before or at t — 1. Note that ^~,^Qap{w^t^)p{w^u)(^l — (1 — m{w , u)Y~^' = 0, so the above still 

holds for t = 1. Similarly, n^GA^^^Cn) (-'- ~ Y^t~^o <^P{'^^t^)p{'^^'^){^ ~ (1 ~ m{w^u)Y~^')) is the probability 
that u has not become active before or at t. Hence, Formula [2] is exactly the probability that u is activated 
at which is ap{u,t). □ 

The recursion given by Formula [2] can be carried out by dynamic programming, traversing from leaves 
to the root. In a general in-arborescence, given as input a node u and a deadline constraint r, the time 
complexity of calculating X^^^q t) by Formula [2] is polynomial to r, which is exponential to the size 
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of the input: 0(logr) bits. In principle, this does not affect efficiency much as r is small (5 or 10), and in 
general much smaller than the size of the graph. 

To reduce the amount of computations, a few optimizations can be applied in implementation. Let 
path{u) be the path from some s G S in A to u that has the minimum length among all such paths. Note 
that we only need to compute ap{u,t) for t G {\path{u)\^ . . . ,r}, as u cannot be reached earlier than step 
\path{u)\. That is, ap{u,t) = when t < \path{u)\. Also, if path{u) = (i.e., does not exist), ap{u^t) = 0, Vt. 

For computing ap{u, t) on a chain of nodes within an in-arborescence, we derive a more efficient method 
that reduces the computation to polynomial to logr, as shown in the next section. 

4.1 Fast Influence Computation on Chain Graphs 

Let rG {!,..., We consider a length-r (directed) chain graph L = (Vl^ ^l), where Vl = {uq^ . . . ^Ur} 
and El = {{ui, 1x^+1) : i G {0, . . . r - 1}}: 

Uq > Ui > . . . > Ur-l > Ur • 

Here, L can be thought of as a "sub-arborescense" of an in-arborescence A. The problem is to compute 
ap{u^t)^ given a step t G {0, . . . ,r}, for any node u on this chain. Notice that for any particular node li, 
what really matters in our analysis is the distance, i.e., the length of the path from the closest effective seed 
to u. Suppose that there is another seed Ug 7^ uq on the chain, where s < r. Nodes . . . ^Ug-i will only 
be possibly influenced by i^o (which is the effective seed for them), due to the structure of the chain, while 
nodes i^s+i, . . . , i^r will only be possibly influenced by Us (the effective seed), since the influence from and 
any node before Us will be blocked by Us- Therefore, without loss of generality, we assume that the seed set 
5* is a singleton set {uq} and the sole seed uq is activated at step 0, i.e., ap{uQ^{)) = 1 and apiu^t) = for 
any t > 1. 

Denote by =^ m{ui-i^Ui) the meeting probability and by pi ^ p{ui-i^Ui) the influence probability 
between Ui-i and Ui^ respectively, for alH G {1, . . . , r}. The activation of node u at step t can be character- 
ized by a probability distribution over all possible t, taking all relevant meeting probabilities and influence 
probabilities into consideration. 

For alH G {1, . . . , r}, let be the random variable indicating the number of steps needed for node Ui-i 
to meet Ui for the first time, given meeting probability m^. Note that Xi is a geometric random variable 
with parameter m^, and taking values from {1,2,3, . . .}. Suppose that the path from uq to u is of length 
i. Let X = Yli=i^i' The following lemma links the activation probability ap(u^t) with the probability 
distribution of X. 

Lemma 2. The activation probability of node u at time t is given as 

apiu,t)= (jlPi^ ■Pv[X = t]. 

Proof. The event that u is activated at time t is equivalent to that for all i G {1, . . . ,-^}, node Ui-i meets 
Ui and activates Ui upon their first meeting, and the first meeting of U£-i and U£ after U£-i is activated 
occurs at time t. Since X = Yli=i^i^ Pi'[X = t] is the probability that uq meets ui^ and then ui meets 
7x2, and so on, and U£-i meets U£^ and the total number of steps taken until U£-i meets ui is t. Y[i=iPi 
the probability that all activation attempts are successful upon first meetings, conditioned on the event that 
meetings of Ui-i and Ui for all i G {!,...,£} occur. Therefore, it is clear that (^fli^iPi) ' ^^[^ = t] is the 
probability that u = ui is activated at time t. □ 

With Lemma |2j the key to compute activation probability ap{u^ t) is to compute the probability distri- 
bution of X, the sum of ^ geometric random variables. When all geometric random variables have the same 
parameter, X is the well-known negative binomial random variable. We restate the result on the distribution 
of negative binomial random variable below. For completeness, a proof is given in the appendix. 
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Lemma 3 (Negative Binomial Distribution). Let X = Xli^i^i; where each Xi is a geometric random 
variable with parameter m and range {1,2, 3,...}. Then we have Ft[X = t] =0 for t < £, and for all t > 



Ft[X = t]=m^ ■ (l-m) 



t-i 



t-l 
£-1 



For the summation of geometric random variables with different parameters, the result is more complex. 
We cannot find the result from literature for the general case, so we provide our own analysis for the case 
that all parameters of geometric random variables are distinct, as shown below. 

Lemma 4. Let X = Yli=i where each Xi is a geometric random variable with parameter rrii and range 
{1, 2, 3, . . .}. Suppose that all rui ^s are distinct. Then we have Pr[X = =0 for t < £, and for all t>£, 

(l-m,)*-i 



Pr[X = *] = (n-.)-E7/ 

i=l i=l il7 = l 



We defer the proof of Lemma [4] to the appendix. With the above lemmas, we can derive the following 
theorem for the computation of activation probabilities for the chain graphs. 

Theorem 6. Let L = (Vl^El) be a directed length-r chain and let {uq} be the seed set. Let u be a node in 
the graph with distance £ from node uq. For the activation probability ap{u,t) of node u at time t, we have 
ap{u,t) = ift<£, and (a) if all meeting probabilities rrii 's of the edges are distinct, then 

ap{u,t) = [YlmiPij -^--^ ^ -; (3) 

(b) if all meeting probabilities rrii equal to a value m, then 

apiu, t) = m'-il- my-' • ' I J) • (4) 

Computing activation probabilities by Theorem [6] has time complexity polynomial to log r for t < r. 
Moreover, if we want to compute the cumulative probability J2t=o (ip{u,t), for Equation ^ it is easy to see 

that the key computation is Xlt=£(l — ^iY~^ = ^"'""^'^ m"^"*^"^'^ ^ which can be done in time polynomial 
to logr; for Equation Q, it is well known that the cumulative distribution of a negative binomial random 
variable can be computed via the regularized incomplete beta function, which can also be done in time 
polynomial to logr. 

Although chain graphs seem to be a rather restricted version of arborescences, in actual computations, 
since the size of the seed set is typically much smaller than the size of the original graph, for any node 
the number of seeds in this particular v's in-arborescence is usually quite small, and hence the chain cases 
will be common. In view of the above, this can help reduce the running time of our dynamic programming 
algorithm. An interesting open problem is to design an algorithm that computes infiuence spread in general 
in- arborescences with running time polynomial in logr. 



5 MIA Algorithms for IC-M 

The aforementioned greedy approximation algorithm is too inefficient to use in practice as it lacks of a way to 
efficiently compute inffuence spread in general graphs (Sec.[2|. To circumvent such inefficiency, we propose 
two MIA-based heuristic algorithms. The first algorithm is MIA-M (Maximum Infiuence Arborescence for IC- 
M) which uses the dynamic programming in Theorem [s] to compute exact infiuence of seeds. The second one 
is MIA-C (Maximum Infiuence Arborescence with Converted propagation probabilities) which first estimates 
propagation probabilities for pairwise users by combining meeting events, infiuence events, and the deadline 
r, and then uses MIA for IC to select seeds. 

Both algorithms first construct a maximum infiuence in-arborescence (MIIA) for each node in the graph, 
we calculate infiuence propagated through these MIIAs to approximate the infiuence in the original network. 
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5.1 The MIA-M Algorithm 

Before describing the algorithm, we first introduce some necessary notations. For a pair of nodes li, let 
V{u^v) be the set of all paths from to in G. Given a path P = {u = ui^ . . . ^ui = v) G Viu^v)^ its 
propagation probability 

Next, we define the maximum influence path from to to be 

VA\P{u^v) = argmaxpp(P). 

Note that W\\P{u^v) = 9 if u = v or V{u,v) = 0. In addition, we require at most one W\\P{u^v) for each u,v 
pair, with ties broken in a consistent way. To compute MIPs, notice that if we transfer infiuence probability 
p{u^v) into edge weight — logp{u^v), computing M\P{u^v) is equivalent to finding the shortest path from u 
to V in G, and this can be done efficiently by Dijkstra's algorithm. 

For MIA-M, we also introduce the "augmented" length ^a(P) of a path P to take meeting events and the 
deadline constraint into account. Consider an edge {ui^Uj) G P. Due to random meeting events, after Ui 
activates at step its infiuence will not propagate to Uj exactly at t-\-l. Instead, the propagation may take 
multiple steps and the number of such steps is a random variable Xij^ which can also be interpreted as the 
number of Bernoulli trials needed to get the first meeting between Ui and Uj after ix^'s activation (See also 
Sec 4.1). Clearly, Xij follows the geometric distribution, with success probability miui^Uj)^ expectation 



—r^ — and standard deviation V^^^i^^A^ Here we propose to estimate the value of Xn j by 

1 - m(ui,Uj) 



and define the augmented path length ^a(P) of P to be the sum of all estimated values of the random variables 
(one per edge) along P: 

1 - m{ui,Uj)\ 



{ui,Uj)eP 



m{ui^Uj) m(ui^Uj) 
We empirically verify that this is a good choice for ^a(P)- 



Constructing Arborescences 

For any node v in G, we approximate the infiuence to v from dl\ u \ {v} using the maximum influence 
in- arborescences (Mil A ) of v. To construct the MIIA rooted at we first take the union over the maximum 
infiuence paths to v over dl\ u \ {v}. After that, two pruning steps will be done. First, we remove paths 
whose propagation probability is below a pre-defined infiuence threshold G (0, 1], which controls the size 
of the local infiuence region and is a trade-off between efficiency and seed set quality. Second, to take the 
effect of deadline into account, we eliminate paths whose augmented length is greater than r. 

Definition 1 (Maximum Inffuence In-Arborescence). Given an influence threshold 9 G (0, 1] and a deadline 
constraint r G the maximum influence in-arborescence of any node v G V is 

M\\Ar{v,0) = Uuev,ppiM\Piu,v))>e/Ai^\Piu,v))<T^^^{u,v) 

The full MIA-M is described in Algorithm [2) where MG{u) = crr{S U {u}) - (Tr{S) is the marginal 
influence of u w.r.t to seed set MG{u, v) is the marginal influence of on a speciflc and realized{v) is 
the cumulative influence realized on v by 5*. Also, for each u gV^ InfSet{u) = {v G V : u G W\\\At-{v^O)}. 

After constructing W\\\Ar{v,0) and using Theorem [s] to obtain cr^({v}) for all v G V (lines 4-10), the 
algorithm selects k seeds iteratively in a greedy manner, and uses Theorem [5] to update the marginal gain of 
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Algorithm 2: MIA-M (G = {V,E), k, 6>, r) 



1 5^0; 

2 \/v eV, MG{v) ^ and realized{v) ^ 0; 

3 yv eV, MIIA^(^;,6') ^ and InfSet(v) ^ 0; 

4 foreach G V do 
Compute W\\\Ar{v,0) (Definition [l]) ; 
foreach u G MIIAr (^;, ^) do 

InfSet{u) ^ InfSet{u) U {v}] 

Compute ap{v,t, {n}, MIIA^(i;, 6*)), Vt < r (Theorem jsl ; 
^^^(^,^7) ^ J2l^QMy.t,{u},M\\Ar{v)); 
MG{u) ^ MG{u) + MG{u, v)] 



for i 

u 



1 ^ /c do 

argmax^g^\5 MG'(^;); 
5'^ 5'U{iz}; 

foreach G InfSet(u) do 
realized(v) += MG{u,v); 
foreach i(; G MIIA-r(^;, ^) do 

Compute ap(i;,t, 5' U{'u;},MIIA^(i;, 6*)), Vt < r (TheoremjHj); 
MGnew{w,v) ^ [J2l^Qap{v,t,SU{w},W\\\Ar{v,0))] - realized(v) 
MG{w) ^ MG{w) + MGnew{w,v) - MG{w,v); 
MG{w,v) ^ MGnew{w,v); 



nodes in related MIIAs (lines 11-20). Specifically, after u is picked as a seed, the activation probability of all 
V G InfSet{u) goes up, and thus we need to update the marginal gain of all w G MIIAt-('u), \/v G InfSet{u). 
Time Complexity: Let rime = max^^y |MIIAt-('u, 6>)|, and rise = max^^y \Inf Set{v)\. Also suppose that 
the maximum running time to compute VA\\/K^{y^9) for any v ^ V hy Dijkstra's algorithm is tj^e- Thus, 
MIA-M runs in 0(\V\{tme + rimer'^) + krimenseirimer + log |V|)). 

5.2 The MIA-C Algorithm 

We now discuss our second algorithm, MIA with Converted propagation probability (MIA-C). It consists 
of two steps. First, for each (u^v) G we estimate a converted propagation probability Pc{u,v) that 
incorporates meeting probability m{u^ v), influence probability p{u^ v), and deadline r, with the intention to 
simulate the influence spread under the IC-M model in the original IC model. Second, after obtaining all 
Pc{u,v), we treat these converted probabilities as parameters for the IC model and run the MIA algorithm 
proposed for IC to select k seeds. 

In the IC-M model with deadline r, the value of Pc{u, v) depends on p{u^ v), m{u, v), and r. We use the 
following conversion function to obtain Pciu^v): 

Pc(u, v) = p(u, v)-[l-{l- m(u, v)Y], (5) 

where {3 G [l,r] is the parameter used to estimate the number of meeting attempts. If j3 is 1, Pc{u,v) = 
p{u^ v)-m{u^ v)^ in which case we are pessimistic that u has only one chance to meet v (the minimum possible, 
assuming u itself activates before r). On the other hand, if P is r, pdu^v) = piu^v) • [1 — (1 — m{u^v)Y]^ 
for which we are optimistic that u has r chances to meet v (the maximum possible). To achieve a balanced 
heuristic, we let /3 = ^ for all pairs oiu^v^ and experiments show that this estimation turns out to be more 
effective than other choices in most cases. 

After the probability conversion step, we utilize MIA (Algorithm 4, [T) to find the seed set, making 
MIA-C take the advantage of updating marginal gains of nodes in an extremely efficient manner. 

The time complexity of converting probabilities is 0(\E\)^ and second part of MIA-C has the same time 
complexity as MIA, which is 0(\V\tme + krime^seirime + log |V|)). 
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6 LDAG Algorithms for LT-M 



In this section, we present an dynamic programming algorithm that computes exact influence spread in 
directed acyclic graphs (DAGs), and use it to develop our heuristic solution based on DAGs. Chen et al. [4] 
proved that computing exact influence spread in general graphs for any node set is #P-hard . Now we show 
that in directed acyclic graphs, computing influence can be done in time linear to the size of the graph. 

6.1 Fast Influence Computation in DAGs 

Consider an acyclic subgraph D = (V^E). Let S C V he the seed set. For any v gV^ let apoiu^t \ S) be 
the activation probability of u and step t, that is, the probability that u is activated right at step t in DAG 
D under the LT-M model, given the seed set S. When the notations are clear from the context, we write 
ap{v,t) for short. By definition, ap{v^t) = 1 if v e S A t = 0. Similarly, ap{v,t) = if v e S A t > 0^ or 
V ^ S At = 0. These cases form the basis for the recursion that we will develop for the dynamic programming 
algorithm, which computes ap{v^ t) when v ^ S At > {). 

The following theorem shows the important linear property of activation probability in a DAG. The proof 
for this theorem also requires Lemma [T] 

Theorem 7. For any v \ S, and any t G [1, r], the activation probability of v at t is 

t-i 

ap{y^t)= ^ h{u^v)^^^^ap{u^t') ■ m{u^v) ■ {1 — m{u^v)y~^ ~^ . (6) 

Proof. By Lemma [l] the event that a certain node v becomes activate at a certain step t has the same 
probability to happen under the LT-M model and the LE-M model. Let Su^v be the event that v selects 
edge {u^v) as a live edge. By the protocol of the live-edge selection process, we have Pr[^^^^] = b{u^v). Let 
R{v, t) be the event that v is reached by 5* at step t. Then, for dl\ v ^V\S^ 

ap{v,t) = Pr[fn,.] •Pr[i^(^,t) | ^n,.]- (7) 

uev\{v} 

In order to activate v at t, the selected u must be already active before or at t — 1. Furthermore, the 
events that u activates at possible steps 0, 1, . . . are mutually exclusive. Hence, the probability that event 
R{v^t) happens conditioned on Su,v is 

t-i 

Ft[R{v, t) I £uA = ' ^) • (1 ~ ^(^' ^)Y~^'~^' (8) 

Putting Eq. ^ and (|8| together, we obtain Eq. ([6|. This completes the proof. □ 

A directed application of Theorem [7] gives us a dynamic programming method to compute activation 
probabilities. 

6.2 The LDAG-M Heuristic Algorithm 

In this subsection, we propose a DAG-based heuristic algorithm, called LDAG-M, which leverages the pro- 
posed dynamic programming approach to compute influence efficiently. The algorithm first constructs a 
local directed acyclic graph (LDAG) for each node v in the graph. These LDAGs are small subgraphs of 
the original network. Then, LDAG-M uses influence propagated through these LDAGs to approximate the 
influence propagated in the original network. 
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Algorithm 3: Find-LDAG (G = {V,E), v, A) 



1 ^ 0; F ^ 0; 

2 foreach u G V \ {v} do 

3 I lnf{v,{u})^0; 

4 Inf{v,{v}) ^ 1; 

5 while max^^y\f/ Inf{v^ {u}) > A do 

6 w ^ argmax^^y^f; Inf{v, {u}); 

7 F ^ FU{{w,x) : X eU}; 

8 ^ UU{w}; 

9 foreach u G M'^^(w) do 

10 L {^i}) += h{u, w) • /n/(v, {w})] 

11 Li:)AG(v, A) ^ (/7,F) and output it; 



Local DAG Construction 

Let Inf^ {v^{u}) be the probability that v gets activated in G when u is the only seed. Similarly, let 
Inf^{v, {u}) be such probability in a DAG D. Given an influence threshold A, a node we wish to find 
a DAG D = {U,F) such that v G /7 C V, F C and Inf^{v,{u}) > A,Vix G U. There are possibly 
more than one DAGs satisfying the above conditions, but ideally, we want to find an optimal DAG D* such 
that J2ueu* -^^f^ i^A^}) largest among all possible choices. Unfortunately, this is NP-hard [4]. 

Therefore, we construct LDAGs heuritically in a greedy fashion. 

We first run the Find-LDAG algorithm due to [4^ (Algorithm |3|. It starts from an empty node set U and 
an empty edge set F. All Inf{v, {u}) are initialized to zero, except that Inf{v^ {u}) = 1. It repeatedly picks 
the node w that has the largest infiuence value to and add edges from w to existing nodes in U into F. 
Then it adds w into /7, and updates the infiuence values of all of w^s in- neighbors. Find-LDAG terminates 
when no new node has infiuence value of at least A. 

This algorithm suffices for the original infiuence maximization problem under the LT model. However, 
in our case, it is very important for us to take meeting events and the deadline constraint r into account. 
To this end, we need to post-process the LDAGs find by Algorithm [sj First, for each edge {u^w) G F, we 

obtain transfer the meeting probability m(u,w) into a distance weight —A — v — m(n w )^ Then, for each 
u G U\{v}, we compute its shortest path to v using Dijkstra's algorithm, and let this path be P(ii, v). Next, 
we calculate augmented path length of P(u,v), which is ^a(P) = T^r., ^^pr., ( — — r — m(n^ ^nj)y 
The pruning condition is that if ^a(P) > then node u and all corresponding edges are removed from 
LDAGiy^ A). Notice that for a DAG, we can do a topological sort on all nodes, so the order of pruning can 
be determined by that, with the v sorted first. This process is refiected in line 4 of Algorithm |4] 

Seed Set Selection and Full LDAG-M Algorithm 

The full LDAG-M algorithm first uses the approach described above to find out a suitable local DAG for all 
nodes in the graph (lines 3-4), then it greedily select at most k seeds, given the input G = {V^E) and the 
budget k. The pseudo-code of LDAG-M is given in Algorithm [4j For each node we also maintain a data 
structure called InfSetiu) := {v ^ V : u ^ LDAGiv^X)}^ a value MG{u) which denotes the incremental 
infiuence by adding u to the current seet set and a value realized{v) denotes the cumulative infiuence 
realized on v by the seed set 5*. After constructing LDAGs and InfSets, we use Theorem [7| to compute obtain 
ar{{u} for all u in the graph (lines 7-9). Line 10-19 iteratively pick k seeds in a greedy manner. After 
selecting a new seed we need to update the incremental infiuence of all w in LDAG{v^ A), Vi' G InfSet{u). 
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Algorithm 4: LDAG-M (G = {V,E), k, A) 



5 ^ 0; G V : MG{v) ^ 0; realized(v) ^ 0; 
foreach v do 

Cow^nte LDAG(v,\) (Algorithm [3|; 
Post-process (prune) LDAG{v^ X); 
foreach u G LDAG{v^ A) do 
InfSet{u) ^ v; 

Compute ap{v^t\{u})^Wt < r (Theorem m); 

MG{u,v)^Zl=oMv.t\My. 
MG{u) += MG{u,v)] 



10 for i = 1 ^ k do 



11 

12 
13 
14 
15 
16 
17 
18 
19 



u ^ argmax^^y\^ MG{v); 
S ^SU{x}; 

foreach v G InfSet{u) do 
realized{v) += MG{u,v); 
foreach w G LDAG{v) do 

Compute ap{v,t\S U {^}), Vt < r (Theorem[7|; 

MGnewi^j ^) ^ ^t=o (^P{^-> ^ {'^D ~ reaHzed{v); 
MG{w) += MGne^(^,^) - MG{w,v); 

MGIw.v) ^ MGnewiw.v); 



Dataset 


NetHEPT 


WikiVote 


Epinions 


DBLP 


Number of nodes 


15K 


7.1K 


75K 


655K 


Number of edges 


62K 


lOlK 


509K 


2.0M 


Average degree 


4.12 


26.6 


13.4 


6.1 


Maximum degree 


64 


1065 


3079 


588 


^Connected components 


1781 


24 


11 


73K 


Largest component size 


6794 


7066 


76K 


517K 


Average component size 


8.55 


296.5 


6.9K 


9.0 



Table 1: Statistics of Real- world Networks. 



7 Empirical Evaluations 

We conduct experiments on four real- world datasets to evaluate MIA-M and MIA-C, and compare them to 
a few other algorithms in terms of seed set quality and running time. All experiments are conducted on a 
server running Microsoft Windows Server 2008 R2 with 2.33GHz Quad-Core Intel Xeon E5410 CPU and 
32 G memory. 

7.1 Experiment Setup 

Dataset Preparation. The statistics of the datasets are summarized in Table [l] NetHEPT is a standard 
dataset in this area: It is a collaboration network extracted from the High Energy Physics Theory section 
(1991 to 2003) of the arXiv e-print repository (http://www.arxiv.org/). The network data is publicly avail- 
able at http:/ /research. microsoft.com/en-us/people/weic/projects.aspx. DBLP (http:/ /www.informatik.uni- 
trier.de/ley/db/) is a much larger collaboration network from the DBLP computer science bibliography server 
maintained by Michael Ley. Nodes in both datasets represent authors, and if u and v collaborated at least 
once, we draw direct arcs (u^v) and {v^u). Note that edges in the NetHEPT and DBLP graphs may carry 
multiplicity greater than 1, because two authors might co-author more than one papers. 
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WikiVote is a who- votes-on- whom network extracted from Wikipedia, the free onhne encyclopedia where 
users can interact with each other when they co-edit the same entries. If v voted on u for promoting u to 
adminship, we draw a directed arc {u^ v) to reflect ^x's influence on v. Epinions is a who-trusts-whom social 
network from the Epinions consumer review site (http:/ /www. epinions. com/). We draw a directed arc {u^ v) 
if V expressed her trust in i^'s reviews explicitly on the Website. Both the WikiVote and Epinions network data 
are can be obtained from the Stanford Network Analysis Project Website (http:/ /snap. stanford.edu/data/). 

Graph Parameters (Models for Assigning Influence and Meeting Probabilities). Influence prob- 
abilities are assigned using the Weighted Cascade (WC) model proposed in Kempe et al. [10 : For WikiV- 
ote and Epinions, p{u,v) = l/(P^{v) where d^^{v) is the in-degree of v. For NetHEPT and DBLP, 
p{u^v) = A{u,v)/A{v) where A{u^v) is the number of papers in which u and v were co-authors, and A{v) 
is the number of papers that v published in total. We also do experiments on the Trivalency (TV) model 
proposed in Chen et al. [2]: On every edge (u^v)^ we choose its edge probability uniformly at random from 
the set {0.001,0.01,0.1}. 

For meeting probabilities, it is reasonable to deem that the more friends an individual u has, the smaller 
the chance that u could meet a certain friend is. Therefore, we assign each edge (i^, v) its meeting proba- 
bility m(ii, v) = ^ont^^^)^g , where d^^^ is the out-degree of u and c is a constant chosen to be 5 here. In 
addition, we test on cases where for every edge {u^v), m{u^v) is chosen uniformly at random from the set 
{0.2,0.3,0.4,0.5,0.6,0.7,0.8}. 

Algorithms Compared. We evaluate MIA-M, MIA-C, the greedy algorithm (Greedy) and the other two: 
Degree and MIA. For Greedy, we apply CELF and run Monte Carlo for 10000 times as in Kempe et al. [10]. 
Degree is a heuristic based on the notion of degree centrality that considers high degree nodes influential. 
It outputs the top-/c highest out-degree nodes as seeds (Kempe et al. [10^). We then test MIA, one of the 
state-of-the-art heuristic algorithms for the standard IC model. For the purpose of comparisons, we let MIA 
select seeds disregarding meeting probabilities and the deadline constraint entirely, i.e., treating m{u^v) = 1 
for all edges and r = \V\. We also test on the Prefix-excluding MIA (PMIA) algorithm, which is a variant 
of MIA [2]. Since the results are similar, we omit it here. MIA-M, MIA-C, and MIA ah use 1/320 as the 
infiuence threshold ^, as recommended by Chen et al. [2 . For MIA-C, we choose P = ^ since it gives more 
stable performance (compared to 1 and r) in most cases. 

7.2 Results and Analysis 

We compare the five algorithms on quality of seeds sets and running time. The deadline r is set to 5 
(relatively short time horizon) and 15 (relatively long time horizon) in all results reported. Greedy is too 
slow to finish on Epinions and DBLP within a reasonable amount of time (3 days). 

Quality of Seed Sets. The quality of seed sets is evaluated based on the expected infiuence spread 
achieved. To ensure fair and accurate comparisons, we run MC simulations 10000 times to get the "ground 
truth" infiuence spread of all seed sets obtained by various algorithms. Fig. [T] and [2] illustrate infiuence 
spread achieved on datasets with weighted and uniform random meeting probabilities, respectively. 

On graphs with weighted meeting probabilities, except for Greedy, MIA-M has the highest seed set 
quality, while MIA-C is the second best in most test cases. MIA-M performs consistently better than Degree 
and MIA, e.g., on Epinions, the infiuence of 50 seeds by MIA-M is 99.4% (r = 5) and 53.6% (r = 15) higher 
than those by MIA. On NetHEPT and WikiVote, MIA-M produces seed sets with equally good quality as 
Greedy does, e.g., on WikiVote, when r = 5 they both achieve infiuence spread of 101; when r = 15, MIA-M 
(181) even achieves 3% higher than Greedy (175). 

When meeting probabilities are assigned uniformly at random, seed sets by MIA-M, MIA-C, and MIA 
tend to have matching infiuence, all being close to Greedy and better than Degree. Most often, MIA-M is 
marginally better than MIA-C and MIA. The reason why MIA catches up is that it picks seeds assuming all 
m(ii, v) = 1 and r = \V\^ and in expectation those seeds will still have high infiuence under uniform random 
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Figure 1: Influence spread (#nodes, Y-axis) against seed set size (X-axis) on graphs with weighted meeting proba- 
biUties. 
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Figure 2: Influence spread (#nodes, Y-axis) against seed set size (X-axis) on graphs with uniform random meeting 
probabilities. 
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Algorithm 


NetHEPT 


WikiVote 


Epinions 


DBLP 


5 


15 


5 


15 


5 


15 


5 


15 


Greedy 


40m 


1.3h 


22m 


28m 










MIA-M 


1.6s 


15s 


7.9s 


43s 


47s 


5.1m 


6.6m 


10m 


MIA-C 


0.3s 


0.3s 


0.4s 


0.5s 


2.7s 


3.3s 


24s 


33s 


MIA 


0.3s 


0.3s 


1.4s 


1.4s 


12s 


13s 


40s 


41s 



Table 2: Running Time (Weighted Meeting Probability) 



Algorithm 


NetHEPT 


WikiVote 


Epinions 


DBLP 
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15 
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15 
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15 
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15 


Greedy 


44m 


1.5h 


l.lh 


3.2h 










MIA-M 


3.8s 


21.4s 


28.7s 


2.5m 


3.3m 


12.4m 


7.3m 


14.3m 


MIA-C 


0.2s 


0.2s 


0.57s 


1.23s 


4.9s 


10.2 s 


32.5s 


45.6s 


MIA 


0.4s 


0.5s 


6.5s 


6.5s 


37.9s 


39.7s 


38.9s 


43.1s 



Table 3: Running Time fo Random Meeting Probabilities 



m(ii, v)'s, because the expectation is taken over the probability space of all meeting events. Also, when r is 
large, the time-critical effect of the deadline is diminishing, so MIA tends to performs better with large r. 

In reality, however, meeting probabilities between individuals in social networks may be quite different 
from being uniform random, and over all test cases it can be seen that MIA-M and MIA-C are more stable 
than MIA. For certain meeting probabilities, MIA has poor performance. 

Running Time. We demonstrate the running time results on weighted meeting probability datasets in 
Table [2] and the results on the uniform random cases in Table |3] Greedy takes 0.5 to 1.3 hours to finish on 
NetHEPT and WikiVote, and fails to complete in a reasonable amount of time (three days) on Epinions and 
DBLP with r = 5. Degree finishes almost instantly in all test cases so it is not included in the table. 

MIA-C and MIA are three orders of magnitude faster than Greedy, since both benefit from the linear- 
ity rule of activation probabilities when updating marginal gains [2 . MIA-C is more efficient because its 
converted probabilities are smaller than the original influence probabilities used in MIA, and hence arbores- 
cences are smaller for MIA-C under the same influence threshold (1/320). MIA-M is two orders of magnitude 
faster than Greedy, and is scalable to large graphs like Epinions and DBLP. It is slower than MIA-C and 
MIA because its dynamic programming procedure computes activation probabilities associated with steps, 
and hence is not compatible with the linearity rule of activation probabilities. 




Figure 3: Influence spread vs. meeting probabilities on NetHEPT. 
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Effects of Deadline Constraints and Meeting Probabilities. It can be seen from both Fig. [T] and [2] 

that as r increases from 5 to 15, seed sets by ah algorithms obtain higher influence spread, which is intuitive 
to see. We also test other values of r, such as 10 and 20, and since the trend is the same, to avoid densely 
clustered figures we do not include them here. For meeting probabilities, we conduct five test cases on 
NetHEPT, running Greedy with meeting probability 0.2,0.4,0.5,0.6,0.8 for each edge in the graph. The 
results in Fig. [3] show that as meeting probabilities increase, the influence spread of the seed set also go up. 

8 Conclusion and Discussions 

In this paper, we extend the IC and LT models, and their generalization, the Triggering Set model, to include 
time-delayed influence diffusion and we consider the time-critical influence maximization problem. We prove 
the submodularity of the influence function under these time-delayed models, and propose fast heuristics 
to solve the problem. There are a number of extensions and future directions on time-critical influence 
maximization. 

One extension is to use login probabilities to model time-delayed influence diffusion, which could fit 
better into online social networks. Specifically, each user has a probability of entering the system, and only 
after this action, the user could be influenced by her friends who are already activated. Incorporating these 
login probabilities into the current models turns out to be more challenging than incorporating meeting 
probabilities, because it introduces dependency in activation events. We have obtained partial results using 
more complicated dynamic programming methods to deal with this case. 
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Appendix 

A Analysis on the summation of geometric random variables 

In this appendix, we provide analysis of the distribution of summation of geometric random variables. We 
first provide a proof for the negative binomial distribution for completeness. 

Lemma [s] (Negative Binomial Distribution, re-stated). Let X = Yli=i'^i^ where each Xi is a 
geometric random variable with parameter m and range {1,2, 3,...}. Then we have Pr[X = =0 for t < 
and for alH > ^, 

Pr[X=t]=m^.(l-mr^Q:J). 



Proof. We prove the lemma by an induction on £. For the base case when ^ = 1, X is just a geometric 
random variable with parameter m, and thus Pr[X = t] = {1 — mY~^ m. For the induction step, assume 
that the lemma holds when ^ = 2: for some z. Let X' = ^^^i Xi and X = "^^^l X^. It is clear that when 
t < z + 1, Pr[X = t] = 0. Then for the case of i = z -\- 1 and t > 2; + 1, we have 



Pr[X = t] = Y^ Ft[X' = t'] • (1 - m) 



t'=z 

t-1 

= E 



• (1 - my 



t-t'-i 

t' - 1 

z-l 

t-1 



m 



(1 -m) 



t-t -1 



m 



,z+l 



t' -I 



• (1 -m) 



t-{z+l) 



t'^z 

't-1 

Z 



where for the second equation we have applied the induction hypothesis, and for the last one we have applied 
the property of binomial coefficients: Xlj=/c H) ~ (fc+i) • This completes the proof. □ 
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We now study the case where all geometric random variables X^'s have distinct parameters m^. Before 
proving Lemma [4j we first show the following technical lemma. 

Lemma 5. Suppose that ^ rrij whenever i ^ j. Then, 

Proof. Let =^ 1 — m^, then Formula (|9| (with i switched to z) becomes 

z ,Z-1 4.Z-1 



- {t, - • ii-j=,^^^,{tj - 1,) n|^,(t, - ■ 

Equivalent ly, we need to show the following holds: 



4-Z-l 



(10) 



For the base case of 2: = 1, it is clear that (10) holds. Suppose Formula (10) holds for some z > 1, now 
we show that it also holds for the case of z + 1. We first conduct the following manipulation: 

U • {tz-\-l — ^2+2) 







tz+2) 


.z-1 
^z+2 




-u) 






tz+2) 


- .z-1 
^z+2 




-U) 






tz+2) 


- .z-1 
^z+2 




-U) 


■ 


n|=i.,^.(i. - 


tz+2) 


.z-1 
^z+2 




-ti) 


jr' ■ 




tz+2) 


^z+2 




-ti) 




n|=i(tj - tz+2) 



tz-\-2 ' {tz-\-l ~ ti) , 
^2+2) ^2+2^2+1 ~ tz-\-2U — titz+1 + Utz-\-2 



tz-\-2 ■ {tz-\-l — ti) 

tz+1 ' {tz-\-2 ~ ti) 
tz+2 ' {tz+1 — ti) 



tl^l • {ti - tz+l) 

tr' 



tU2-{t^-tz+l) tl-\-^^,^^^^{t,-t,) 

tl+1 ' n|=i(t_^- - tz+2) ti ^ • ^j=ij^i{tj - tz+i) 



tl+2 ' n|=i(t, - tz+i) ti'l • u^,=,^,^,{tj - 1,) 

With the above, we have 



y-v tt ^ • ^j^ij^ijtj - tz+2) _ ^ tl • ^jtlj^ijtj - tz+2 

i=l tl^2 ' ^j = l,j^i(tj - ti) .^^ tl^2 ■ ^j = l,j^iitj - ti) 



+z . rtz+1 f+. _+\ ^ ^ 



^z+2 ' - it ^U2 ■ n,^;i,,/.(^. - 1^) tU2 ■ n|=i(t, - tz+l) 

_ tl^l • ^j = l{tj - tz+2) ^ t^ ^ • ^ j=ij^i{tj - tz+l) • Ilj = i{tj - tz+2) 

fZ-1 

■ 1 t/^l-l 



0, (12) 



tl+i • n|=i(tj - tz+2) ti^i ■ n|^i(tj- - tz+2) 



tz+2 • n|=i(ti - tz+l) • ^j=iitj - tz+l) 
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where (12) is obtained by applying the induction hypothesis. Also notice that in (11) 



based again on the induction hypothesis (with t^+i replaced by ^2+2)- 
Hence, we have 



= 1 ^^i=l,jViV'^J 



This implies that equation (10) holds for all 2: > 1, which completes the proof. □ 



We are now ready to prove Lemma [4j 



Lemma [4| (re-stated). Let X = Xl^^^ X^, where each Xi is a geometric random variable with parameter 
rrii and range {1,2, 3,...}. Suppose that all m^'s are distinct. Then we have Ft[X = t] =0 for t < and 
for all t > 

P.|X=,| = (n,n.).t „/'-7'''" ^ . 

Proof. We prove the lemma by an induction on the path length i. For the base case when ^ = 1, X = Xi, 
and thus Pr[X = t] = {1 — mi)^~^mi for all t > 1. For the induction step, suppose that the lemma is true 
for ^ = 2: for some z G Z+. Let X' = Xl^^^ Xi and X = X]i=i ^i- induction hypothesis, we have 

that for alH > 

(l-mi)*-i 



Now consider the case of £ = z -\- 1. It is clear that for t < z + 1, Pr[X = t] =0. For t > z + 1, the event 
{X = t} is the union of mutually exclusive events of {X' = t', X^+i = t — t'} for alH' = z + 1, . . . , t — 1. 
Thus we have 



Pr[X = t] = Y^ Pr[X^ = t'] • (1 - m,+i)*-*'-i • m,+i 



where we have applied the induction hypothesis for the last equality. 
Next, we switch the order of the two summations and get 



Pr[X=t]=m 

\i = l 
/2 + I 

= n 

\i=l 

= n 



rui 



{l-rrii) 
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i-2 t-1 / 



E 



(1 - m^+i)* 



(l-m,+i)*-*'-i 

1 — m, 



z-l 



t'-l 



f l — rrii \ f l — rrii \ 
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y 



f l-rrii ^ 
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(1 - vfiiY ^(1 - m^+i) 



t-z 



(1 - Vfli) 



t-\ 



= n 



m,- 



1 (mi - m^+i) ■ Uj=i,j^i{rnj - m) 



For the second summation term in ( |T3| , we need to prove the following equation: 

{1 - Tmr-Hl - m.+^y-' _ (l-m,+i)*-i 



(13) 



E 



(14) 



This can be done by dividing both sides of (14) by (1 — mz-\-iY ^ and applying Lemma [s] Then, we 
substitute (14) back into (13) and get 



z+l 



Pv[X = t]=(llmA-J2 



{l-miY 



Thus, the lemma holds for £ = z + 1, and this completes the proof. 



□ 
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